Picture for Yan Li

Yan Li

University of Minnesota

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Add code
Jun 01, 2026
Viaarxiv icon

EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization

Add code
May 28, 2026
Viaarxiv icon

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Add code
May 25, 2026
Viaarxiv icon

Reasoning to Align: Implicit Reasoning in Diffusion Transformers for Video Editing

Add code
May 23, 2026
Viaarxiv icon

Multimodal LLMs under Pairwise Modalities

Add code
May 20, 2026
Viaarxiv icon

A Dialogue between Causal and Traditional Representation Learning: Toward Mutual Benefits in a Unified Formulation

Add code
May 20, 2026
Viaarxiv icon

LiWi: Layering in the Wild

Add code
May 14, 2026
Viaarxiv icon

From Trajectories to Phenotypes: Disease Progression as Structural Priors for Multi-organ Imaging Representation Learning

Add code
May 12, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition

Add code
May 11, 2026
Viaarxiv icon